Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 40395 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 14.9 MiB |
| Average record size in memory | 387.5 B |
Variable types
| NUM | 8 |
|---|---|
| BOOL | 6 |
| CAT | 5 |
city_name has a high cardinality: 1046 distinct values | High cardinality |
region is highly correlated with province | High correlation |
province is highly correlated with region | High correlation |
surface_of_the_land is highly skewed (γ1 = 53.15034165) | Skewed |
df_index has unique values | Unique |
surface_of_the_land has 20751 (51.4%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-18 09:28:10.524422 |
|---|---|
| Analysis finished | 2020-09-18 09:28:28.824419 |
| Duration | 18.3 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 40395 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25808.86065 |
|---|---|
| Minimum | 0 |
| Maximum | 52075 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2572.7 |
| Q1 | 12715.5 |
| median | 25666 |
| Q3 | 38836 |
| 95-th percentile | 49437.3 |
| Maximum | 52075 |
| Range | 52075 |
| Interquartile range (IQR) | 26120.5 |
Descriptive statistics
| Standard deviation | 15083.3038 |
|---|---|
| Coefficient of variation (CV) | 0.5844234663 |
| Kurtosis | -1.208587665 |
| Mean | 25808.86065 |
| Median Absolute Deviation (MAD) | 13087 |
| Skewness | 0.01817640765 |
| Sum | 1042548926 |
| Variance | 227506053.6 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 11727 | 1 | < 0.1% | |
| 21984 | 1 | < 0.1% | |
| 42462 | 1 | < 0.1% | |
| 48605 | 1 | < 0.1% | |
| 36315 | 1 | < 0.1% | |
| 34266 | 1 | < 0.1% | |
| 40409 | 1 | < 0.1% | |
| 38360 | 1 | < 0.1% | |
| 9678 | 1 | < 0.1% | |
| 19939 | 1 | < 0.1% | |
| 15821 | 1 | < 0.1% | |
| 13772 | 1 | < 0.1% | |
| 3531 | 1 | < 0.1% | |
| 1482 | 1 | < 0.1% | |
| 7625 | 1 | < 0.1% | |
| 5576 | 1 | < 0.1% | |
| 26054 | 1 | < 0.1% | |
| 24033 | 1 | < 0.1% | |
| 32229 | 1 | < 0.1% | |
| 26118 | 1 | < 0.1% | |
| 34298 | 1 | < 0.1% | |
| 30212 | 1 | < 0.1% | |
| 17922 | 1 | < 0.1% | |
| 24065 | 1 | < 0.1% | |
| Other values (40370) | 40370 | 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 11 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 52075 | 1 | < 0.1% | |
| 52073 | 1 | < 0.1% | |
| 52072 | 1 | < 0.1% | |
| 52071 | 1 | < 0.1% | |
| 52070 | 1 | < 0.1% | |
| 52068 | 1 | < 0.1% | |
| 52067 | 1 | < 0.1% | |
| 52065 | 1 | < 0.1% | |
| 52064 | 1 | < 0.1% | |
| 52063 | 1 | < 0.1% |
postal_code
Real number (ℝ≥0)
| Distinct | 1057 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5195.044139 |
|---|---|
| Minimum | 1000 |
| Maximum | 9992 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1080 |
| Q1 | 2360 |
| median | 4630 |
| Q3 | 8400 |
| 95-th percentile | 9420 |
| Maximum | 9992 |
| Range | 8992 |
| Interquartile range (IQR) | 6040 |
Descriptive statistics
| Standard deviation | 2979.185308 |
|---|---|
| Coefficient of variation (CV) | 0.5734667942 |
| Kurtosis | -1.517977446 |
| Mean | 5195.044139 |
| Median Absolute Deviation (MAD) | 2845 |
| Skewness | 0.08975300772 |
| Sum | 209853808 |
| Variance | 8875545.1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 8300 | 750 | 1.9% | |
| 8400 | 685 | 1.7% | |
| 9000 | 634 | 1.6% | |
| 1180 | 498 | 1.2% | |
| 1000 | 451 | 1.1% | |
| 8370 | 447 | 1.1% | |
| 4000 | 394 | 1.0% | |
| 8670 | 365 | 0.9% | |
| 2000 | 323 | 0.8% | |
| 1050 | 323 | 0.8% | |
| 1070 | 301 | 0.7% | |
| 1030 | 300 | 0.7% | |
| 9300 | 289 | 0.7% | |
| 8620 | 285 | 0.7% | |
| 3500 | 279 | 0.7% | |
| 1080 | 269 | 0.7% | |
| 8000 | 267 | 0.7% | |
| 2300 | 259 | 0.6% | |
| 2100 | 258 | 0.6% | |
| 2018 | 248 | 0.6% | |
| 8800 | 235 | 0.6% | |
| 8660 | 232 | 0.6% | |
| 9100 | 219 | 0.5% | |
| 4020 | 218 | 0.5% | |
| 8430 | 217 | 0.5% | |
| Other values (1032) | 31649 | 78.3% |
| Value | Count | Frequency (%) | |
| 1000 | 451 | 1.1% | |
| 1020 | 123 | 0.3% | |
| 1030 | 300 | 0.7% | |
| 1040 | 145 | 0.4% | |
| 1050 | 323 | 0.8% | |
| 1060 | 129 | 0.3% | |
| 1070 | 301 | 0.7% | |
| 1080 | 269 | 0.7% | |
| 1081 | 64 | 0.2% | |
| 1082 | 57 | 0.1% |
| Value | Count | Frequency (%) | |
| 9992 | 5 | < 0.1% | |
| 9991 | 14 | < 0.1% | |
| 9990 | 49 | 0.1% | |
| 9988 | 10 | < 0.1% | |
| 9982 | 2 | < 0.1% | |
| 9981 | 3 | < 0.1% | |
| 9980 | 4 | < 0.1% | |
| 9971 | 3 | < 0.1% | |
| 9970 | 6 | < 0.1% | |
| 9968 | 15 | < 0.1% |
| Distinct | 1046 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| Antwerpen | 886 |
|---|---|
| Knokke | 750 |
| Oostende | 685 |
| Gent | 634 |
| Uccle | 498 |
| Other values (1041) |
| Value | Count | Frequency (%) | |
| Antwerpen | 886 | 2.2% | |
| Knokke | 750 | 1.9% | |
| Oostende | 685 | 1.7% | |
| Gent | 634 | 1.6% | |
| Uccle | 498 | 1.2% | |
| Bruxelles | 451 | 1.1% | |
| Uitkerke | 447 | 1.1% | |
| Glain | 394 | 1.0% | |
| Wulpen | 365 | 0.9% | |
| Ixelles | 323 | 0.8% | |
| Deurne | 309 | 0.8% | |
| Anderlecht | 301 | 0.7% | |
| Schaerbeek | 300 | 0.7% | |
| Aalst | 289 | 0.7% | |
| Nieuwpoort | 285 | 0.7% | |
| Hasselt | 279 | 0.7% | |
| Molenbeek-Saint-Jean | 269 | 0.7% | |
| Brugge | 267 | 0.7% | |
| Turnhout | 259 | 0.6% | |
| Beveren | 258 | 0.6% | |
| De Panne | 232 | 0.6% | |
| Nieuwkerken-Waas | 219 | 0.5% | |
| Liège | 218 | 0.5% | |
| Middelkerke | 217 | 0.5% | |
| Renaix | 213 | 0.5% | |
| Other values (1021) | 31047 | 76.9% |
Unique
| Unique | 84 ? |
|---|---|
| Unique (%) | 0.2% |
Length
| Max length | 30 |
|---|---|
| Median length | 8 |
| Mean length | 8.565614556 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 63911 | 18.5% | |
| n | 26666 | 7.7% | |
| r | 23896 | 6.9% | |
| a | 18703 | 5.4% | |
| l | 18655 | 5.4% | |
| o | 16402 | 4.7% | |
| i | 16399 | 4.7% | |
| t | 15978 | 4.6% | |
| s | 14811 | 4.3% | |
| u | 11132 | 3.2% | |
| k | 8892 | 2.6% | |
| - | 8418 | 2.4% | |
| m | 7537 | 2.2% | |
| g | 6489 | 1.9% | |
| B | 6027 | 1.7% | |
| d | 5965 | 1.7% | |
| h | 5198 | 1.5% | |
| b | 4666 | 1.3% | |
| c | 4450 | 1.3% | |
| p | 4392 | 1.3% | |
| L | 4005 | 1.2% | |
| S | 3798 | 1.1% | |
| A | 3796 | 1.1% | |
| M | 3706 | 1.1% | |
| G | 3426 | 1.0% | |
| Other values (37) | 38690 | 11.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 287417 | 83.1% | |
| Uppercase Letter | 49294 | 14.2% | |
| Dash Punctuation | 8418 | 2.4% | |
| Space Separator | 517 | 0.1% | |
| Other Punctuation | 362 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 6027 | 12.2% | |
| L | 4005 | 8.1% | |
| S | 3798 | 7.7% | |
| A | 3796 | 7.7% | |
| M | 3706 | 7.5% | |
| G | 3426 | 7.0% | |
| H | 3343 | 6.8% | |
| W | 2892 | 5.9% | |
| E | 1953 | 4.0% | |
| K | 1952 | 4.0% | |
| O | 1816 | 3.7% | |
| D | 1723 | 3.5% | |
| N | 1489 | 3.0% | |
| T | 1341 | 2.7% | |
| P | 1252 | 2.5% | |
| R | 1132 | 2.3% | |
| C | 1023 | 2.1% | |
| U | 945 | 1.9% | |
| J | 944 | 1.9% | |
| F | 875 | 1.8% | |
| Z | 642 | 1.3% | |
| I | 609 | 1.2% | |
| V | 563 | 1.1% | |
| Q | 33 | 0.1% | |
| À | 6 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 63911 | 22.2% | |
| n | 26666 | 9.3% | |
| r | 23896 | 8.3% | |
| a | 18703 | 6.5% | |
| l | 18655 | 6.5% | |
| o | 16402 | 5.7% | |
| i | 16399 | 5.7% | |
| t | 15978 | 5.6% | |
| s | 14811 | 5.2% | |
| u | 11132 | 3.9% | |
| k | 8892 | 3.1% | |
| m | 7537 | 2.6% | |
| g | 6489 | 2.3% | |
| d | 5965 | 2.1% | |
| h | 5198 | 1.8% | |
| b | 4666 | 1.6% | |
| c | 4450 | 1.5% | |
| p | 4392 | 1.5% | |
| v | 3185 | 1.1% | |
| w | 2601 | 0.9% | |
| x | 1788 | 0.6% | |
| z | 1373 | 0.5% | |
| j | 1036 | 0.4% | |
| f | 835 | 0.3% | |
| y | 682 | 0.2% | |
| Other values (8) | 1775 | 0.6% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 8418 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 362 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 517 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 336711 | 97.3% | |
| Common | 9297 | 2.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 63911 | 19.0% | |
| n | 26666 | 7.9% | |
| r | 23896 | 7.1% | |
| a | 18703 | 5.6% | |
| l | 18655 | 5.5% | |
| o | 16402 | 4.9% | |
| i | 16399 | 4.9% | |
| t | 15978 | 4.7% | |
| s | 14811 | 4.4% | |
| u | 11132 | 3.3% | |
| k | 8892 | 2.6% | |
| m | 7537 | 2.2% | |
| g | 6489 | 1.9% | |
| B | 6027 | 1.8% | |
| d | 5965 | 1.8% | |
| h | 5198 | 1.5% | |
| b | 4666 | 1.4% | |
| c | 4450 | 1.3% | |
| p | 4392 | 1.3% | |
| L | 4005 | 1.2% | |
| S | 3798 | 1.1% | |
| A | 3796 | 1.1% | |
| M | 3706 | 1.1% | |
| G | 3426 | 1.0% | |
| H | 3343 | 1.0% | |
| Other values (34) | 34468 | 10.2% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 8418 | 90.5% | |
| 517 | 5.6% | ||
| ' | 362 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 344523 | 99.6% | |
| None | 1485 | 0.4% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 63911 | 18.6% | |
| n | 26666 | 7.7% | |
| r | 23896 | 6.9% | |
| a | 18703 | 5.4% | |
| l | 18655 | 5.4% | |
| o | 16402 | 4.8% | |
| i | 16399 | 4.8% | |
| t | 15978 | 4.6% | |
| s | 14811 | 4.3% | |
| u | 11132 | 3.2% | |
| k | 8892 | 2.6% | |
| - | 8418 | 2.4% | |
| m | 7537 | 2.2% | |
| g | 6489 | 1.9% | |
| B | 6027 | 1.7% | |
| d | 5965 | 1.7% | |
| h | 5198 | 1.5% | |
| b | 4666 | 1.4% | |
| c | 4450 | 1.3% | |
| p | 4392 | 1.3% | |
| L | 4005 | 1.2% | |
| S | 3798 | 1.1% | |
| A | 3796 | 1.1% | |
| M | 3706 | 1.1% | |
| G | 3426 | 1.0% | |
| Other values (29) | 37205 | 10.8% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| é | 681 | 45.9% | |
| è | 549 | 37.0% | |
| ê | 89 | 6.0% | |
| â | 74 | 5.0% | |
| ô | 67 | 4.5% | |
| ë | 10 | 0.7% | |
| à | 9 | 0.6% | |
| À | 6 | 0.4% |
type_of_property
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 21440 | 53.1% | |
| 1 | 18955 | 46.9% |
price
Real number (ℝ≥0)
| Distinct | 3517 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 314114.6616 |
|---|---|
| Minimum | 2500 |
| Maximum | 950000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 2500 |
|---|---|
| 5-th percentile | 120000 |
| Q1 | 199000 |
| median | 275000 |
| Q3 | 379000 |
| 95-th percentile | 680000 |
| Maximum | 950000 |
| Range | 947500 |
| Interquartile range (IQR) | 180000 |
Descriptive statistics
| Standard deviation | 168151.6724 |
|---|---|
| Coefficient of variation (CV) | 0.5353194006 |
| Kurtosis | 1.949629927 |
| Mean | 314114.6616 |
| Median Absolute Deviation (MAD) | 85000 |
| Skewness | 1.370488373 |
| Sum | 1.268866176e+10 |
| Variance | 2.827498492e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 249000 | 556 | 1.4% | |
| 199000 | 551 | 1.4% | |
| 299000 | 545 | 1.3% | |
| 225000 | 521 | 1.3% | |
| 295000 | 520 | 1.3% | |
| 275000 | 517 | 1.3% | |
| 325000 | 428 | 1.1% | |
| 175000 | 417 | 1.0% | |
| 235000 | 415 | 1.0% | |
| 195000 | 411 | 1.0% | |
| 395000 | 409 | 1.0% | |
| 185000 | 402 | 1.0% | |
| 265000 | 399 | 1.0% | |
| 245000 | 389 | 1.0% | |
| 250000 | 387 | 1.0% | |
| 285000 | 371 | 0.9% | |
| 349000 | 369 | 0.9% | |
| 215000 | 354 | 0.9% | |
| 165000 | 338 | 0.8% | |
| 239000 | 335 | 0.8% | |
| 350000 | 327 | 0.8% | |
| 269000 | 326 | 0.8% | |
| 220000 | 314 | 0.8% | |
| 179000 | 313 | 0.8% | |
| 229000 | 312 | 0.8% | |
| Other values (3492) | 30169 | 74.7% |
| Value | Count | Frequency (%) | |
| 2500 | 3 | < 0.1% | |
| 6600 | 1 | < 0.1% | |
| 8160 | 1 | < 0.1% | |
| 9999 | 1 | < 0.1% | |
| 10000 | 4 | < 0.1% | |
| 11825 | 1 | < 0.1% | |
| 12500 | 1 | < 0.1% | |
| 14500 | 1 | < 0.1% | |
| 15000 | 6 | < 0.1% | |
| 19000 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 950000 | 70 | 0.2% | |
| 949000 | 8 | < 0.1% | |
| 948000 | 2 | < 0.1% | |
| 947000 | 3 | < 0.1% | |
| 945000 | 32 | 0.1% | |
| 940000 | 7 | < 0.1% | |
| 939000 | 1 | < 0.1% | |
| 936000 | 1 | < 0.1% | |
| 935000 | 3 | < 0.1% | |
| 930000 | 9 | < 0.1% |
number_of_rooms
Real number (ℝ≥0)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.813838346 |
|---|---|
| Minimum | 1 |
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 18 |
| Range | 17 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.26096777 |
|---|---|
| Coefficient of variation (CV) | 0.4481308502 |
| Kurtosis | 6.883403348 |
| Mean | 2.813838346 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.578245584 |
| Sum | 113665 |
| Variance | 1.590039718 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 13847 | 34.3% | |
| 3 | 13367 | 33.1% | |
| 4 | 5747 | 14.2% | |
| 1 | 4145 | 10.3% | |
| 5 | 2035 | 5.0% | |
| 6 | 789 | 2.0% | |
| 7 | 228 | 0.6% | |
| 8 | 104 | 0.3% | |
| 9 | 44 | 0.1% | |
| 10 | 42 | 0.1% | |
| 11 | 20 | < 0.1% | |
| 12 | 12 | < 0.1% | |
| 13 | 4 | < 0.1% | |
| 15 | 4 | < 0.1% | |
| 16 | 3 | < 0.1% | |
| 14 | 3 | < 0.1% | |
| 18 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 4145 | 10.3% | |
| 2 | 13847 | 34.3% | |
| 3 | 13367 | 33.1% | |
| 4 | 5747 | 14.2% | |
| 5 | 2035 | 5.0% | |
| 6 | 789 | 2.0% | |
| 7 | 228 | 0.6% | |
| 8 | 104 | 0.3% | |
| 9 | 44 | 0.1% | |
| 10 | 42 | 0.1% |
| Value | Count | Frequency (%) | |
| 18 | 1 | < 0.1% | |
| 16 | 3 | < 0.1% | |
| 15 | 4 | < 0.1% | |
| 14 | 3 | < 0.1% | |
| 13 | 4 | < 0.1% | |
| 12 | 12 | < 0.1% | |
| 11 | 20 | < 0.1% | |
| 10 | 42 | 0.1% | |
| 9 | 44 | 0.1% | |
| 8 | 104 | 0.3% |
house_area
Real number (ℝ≥0)
| Distinct | 657 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 152.4663201 |
|---|---|
| Minimum | 5 |
| Maximum | 3560 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 92 |
| median | 130 |
| Q3 | 184 |
| 95-th percentile | 324 |
| Maximum | 3560 |
| Range | 3555 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 95.64920638 |
|---|---|
| Coefficient of variation (CV) | 0.6273464614 |
| Kurtosis | 60.1319554 |
| Mean | 152.4663201 |
| Median Absolute Deviation (MAD) | 42 |
| Skewness | 4.041212374 |
| Sum | 6158877 |
| Variance | 9148.770682 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 90 | 892 | 2.2% | |
| 120 | 890 | 2.2% | |
| 100 | 876 | 2.2% | |
| 150 | 812 | 2.0% | |
| 140 | 746 | 1.8% | |
| 80 | 733 | 1.8% | |
| 110 | 700 | 1.7% | |
| 160 | 685 | 1.7% | |
| 200 | 683 | 1.7% | |
| 130 | 657 | 1.6% | |
| 85 | 648 | 1.6% | |
| 180 | 567 | 1.4% | |
| 70 | 550 | 1.4% | |
| 75 | 527 | 1.3% | |
| 95 | 517 | 1.3% | |
| 125 | 450 | 1.1% | |
| 170 | 446 | 1.1% | |
| 115 | 434 | 1.1% | |
| 105 | 409 | 1.0% | |
| 135 | 377 | 0.9% | |
| 220 | 343 | 0.8% | |
| 145 | 342 | 0.8% | |
| 60 | 341 | 0.8% | |
| 250 | 327 | 0.8% | |
| 65 | 318 | 0.8% | |
| Other values (632) | 26125 | 64.7% |
| Value | Count | Frequency (%) | |
| 5 | 3 | < 0.1% | |
| 11 | 1 | < 0.1% | |
| 13 | 2 | < 0.1% | |
| 14 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 16 | 5 | < 0.1% | |
| 17 | 6 | < 0.1% | |
| 18 | 22 | 0.1% | |
| 19 | 2 | < 0.1% | |
| 20 | 9 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3560 | 1 | < 0.1% | |
| 2019 | 1 | < 0.1% | |
| 1700 | 1 | < 0.1% | |
| 1640 | 1 | < 0.1% | |
| 1500 | 2 | < 0.1% | |
| 1461 | 1 | < 0.1% | |
| 1350 | 1 | < 0.1% | |
| 1339 | 1 | < 0.1% | |
| 1200 | 2 | < 0.1% | |
| 1121 | 1 | < 0.1% |
fully_equipped_kitchen
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 28176 | 69.8% | |
| 0 | 12219 | 30.2% |
open_fire
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 0 | |
|---|---|
| 1 | 2165 |
| Value | Count | Frequency (%) | |
| 0 | 38230 | 94.6% | |
| 1 | 2165 | 5.4% |
terrace
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 25052 | 62.0% | |
| 0 | 15343 | 38.0% |
garden
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 27419 | 67.9% | |
| 1 | 12976 | 32.1% |
| Distinct | 2952 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 545.8400792 |
|---|---|
| Minimum | 0 |
| Maximum | 400000 |
| Zeros | 20751 |
| Zeros (%) | 51.4% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 416 |
| 95-th percentile | 1840 |
| Maximum | 400000 |
| Range | 400000 |
| Interquartile range (IQR) | 416 |
Descriptive statistics
| Standard deviation | 3609.242736 |
|---|---|
| Coefficient of variation (CV) | 6.612271383 |
| Kurtosis | 4663.468703 |
| Mean | 545.8400792 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.15034165 |
| Sum | 22049210 |
| Variance | 13026633.12 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 20751 | 51.4% | |
| 150 | 169 | 0.4% | |
| 200 | 160 | 0.4% | |
| 1000 | 145 | 0.4% | |
| 300 | 144 | 0.4% | |
| 250 | 142 | 0.4% | |
| 100 | 138 | 0.3% | |
| 120 | 129 | 0.3% | |
| 400 | 120 | 0.3% | |
| 600 | 115 | 0.3% | |
| 180 | 111 | 0.3% | |
| 130 | 110 | 0.3% | |
| 210 | 108 | 0.3% | |
| 140 | 104 | 0.3% | |
| 90 | 102 | 0.3% | |
| 160 | 101 | 0.3% | |
| 110 | 97 | 0.2% | |
| 70 | 96 | 0.2% | |
| 170 | 95 | 0.2% | |
| 80 | 92 | 0.2% | |
| 500 | 92 | 0.2% | |
| 800 | 90 | 0.2% | |
| 60 | 84 | 0.2% | |
| 220 | 82 | 0.2% | |
| 240 | 80 | 0.2% | |
| Other values (2927) | 16938 | 41.9% |
| Value | Count | Frequency (%) | |
| 0 | 20751 | 51.4% | |
| 1 | 19 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 2 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 10 | 3 | < 0.1% | |
| 12 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 400000 | 1 | < 0.1% | |
| 264781 | 1 | < 0.1% | |
| 120300 | 1 | < 0.1% | |
| 120000 | 2 | < 0.1% | |
| 117800 | 1 | < 0.1% | |
| 99148 | 1 | < 0.1% | |
| 98822 | 1 | < 0.1% | |
| 88800 | 1 | < 0.1% | |
| 87600 | 1 | < 0.1% | |
| 86435 | 2 | < 0.1% |
number_of_facades
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 2 | |
|---|---|
| 0 | |
| 4 | |
| 3 |
| Value | Count | Frequency (%) | |
| 2 | 14531 | 36.0% | |
| 0 | 10360 | 25.6% | |
| 4 | 8104 | 20.1% | |
| 3 | 7400 | 18.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 14531 | 36.0% | |
| 0 | 10360 | 25.6% | |
| 4 | 8104 | 20.1% | |
| 3 | 7400 | 18.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 40395 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 14531 | 36.0% | |
| 0 | 10360 | 25.6% | |
| 4 | 8104 | 20.1% | |
| 3 | 7400 | 18.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 40395 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 14531 | 36.0% | |
| 0 | 10360 | 25.6% | |
| 4 | 8104 | 20.1% | |
| 3 | 7400 | 18.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 40395 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 14531 | 36.0% | |
| 0 | 10360 | 25.6% | |
| 4 | 8104 | 20.1% | |
| 3 | 7400 | 18.3% |
swimming_pool
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| 0 | |
|---|---|
| 1 | 696 |
| Value | Count | Frequency (%) | |
| 0 | 39699 | 98.3% | |
| 1 | 696 | 1.7% |
state_of_the_building
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| as new | |
|---|---|
| good | |
| unknown | |
| to be done up | |
| to renovate | |
| Other values (2) |
| Value | Count | Frequency (%) | |
| as new | 12096 | 29.9% | |
| good | 10985 | 27.2% | |
| unknown | 9796 | 24.3% | |
| to be done up | 2789 | 6.9% | |
| to renovate | 2441 | 6.0% | |
| just renovated | 2147 | 5.3% | |
| to restore | 141 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 6.923233073 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 48861 | 17.5% | |
| o | 44655 | 16.0% | |
| e | 27132 | 9.7% | |
| 25192 | 9.0% | ||
| w | 21892 | 7.8% | |
| a | 16684 | 6.0% | |
| d | 15921 | 5.7% | |
| u | 14732 | 5.3% | |
| s | 14384 | 5.1% | |
| t | 12247 | 4.4% | |
| g | 10985 | 3.9% | |
| k | 9796 | 3.5% | |
| r | 4870 | 1.7% | |
| v | 4588 | 1.6% | |
| b | 2789 | 1.0% | |
| p | 2789 | 1.0% | |
| j | 2147 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 254472 | 91.0% | |
| Space Separator | 25192 | 9.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 48861 | 19.2% | |
| o | 44655 | 17.5% | |
| e | 27132 | 10.7% | |
| w | 21892 | 8.6% | |
| a | 16684 | 6.6% | |
| d | 15921 | 6.3% | |
| u | 14732 | 5.8% | |
| s | 14384 | 5.7% | |
| t | 12247 | 4.8% | |
| g | 10985 | 4.3% | |
| k | 9796 | 3.8% | |
| r | 4870 | 1.9% | |
| v | 4588 | 1.8% | |
| b | 2789 | 1.1% | |
| p | 2789 | 1.1% | |
| j | 2147 | 0.8% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 25192 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 254472 | 91.0% | |
| Common | 25192 | 9.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 48861 | 19.2% | |
| o | 44655 | 17.5% | |
| e | 27132 | 10.7% | |
| w | 21892 | 8.6% | |
| a | 16684 | 6.6% | |
| d | 15921 | 6.3% | |
| u | 14732 | 5.8% | |
| s | 14384 | 5.7% | |
| t | 12247 | 4.8% | |
| g | 10985 | 4.3% | |
| k | 9796 | 3.8% | |
| r | 4870 | 1.9% | |
| v | 4588 | 1.8% | |
| b | 2789 | 1.1% | |
| p | 2789 | 1.1% | |
| j | 2147 | 0.8% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 25192 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 279664 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 48861 | 17.5% | |
| o | 44655 | 16.0% | |
| e | 27132 | 9.7% | |
| 25192 | 9.0% | ||
| w | 21892 | 7.8% | |
| a | 16684 | 6.0% | |
| d | 15921 | 5.7% | |
| u | 14732 | 5.3% | |
| s | 14384 | 5.1% | |
| t | 12247 | 4.4% | |
| g | 10985 | 3.9% | |
| k | 9796 | 3.5% | |
| r | 4870 | 1.7% | |
| v | 4588 | 1.6% | |
| b | 2789 | 1.0% | |
| p | 2789 | 1.0% | |
| j | 2147 | 0.8% |
lattitude
Real number (ℝ≥0)
| Distinct | 1051 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.313450246 |
|---|---|
| Minimum | 2.580669689 |
| Maximum | 6.3009381 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 2.580669689 |
|---|---|
| 5-th percentile | 2.9203275 |
| Q1 | 3.7141549 |
| median | 4.361194615 |
| Q3 | 4.849314652 |
| 95-th percentile | 5.622980506 |
| Maximum | 6.3009381 |
| Range | 3.720268411 |
| Interquartile range (IQR) | 1.135159752 |
Descriptive statistics
| Standard deviation | 0.8119890403 |
|---|---|
| Coefficient of variation (CV) | 0.1882458343 |
| Kurtosis | -0.6577956038 |
| Mean | 4.313450246 |
| Median Absolute Deviation (MAD) | 0.5635523147 |
| Skewness | -0.07418063984 |
| Sum | 174241.8227 |
| Variance | 0.6593262016 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4.3997081 | 886 | 2.2% | |
| 3.323373861 | 750 | 1.9% | |
| 2.9203275 | 685 | 1.7% | |
| 3.7141549 | 634 | 1.6% | |
| 4.3372348 | 498 | 1.2% | |
| 4.351697 | 451 | 1.1% | |
| 3.14048681 | 447 | 1.1% | |
| 5.541864 | 394 | 1.0% | |
| 2.707311916 | 365 | 0.9% | |
| 4.3815707 | 323 | 0.8% | |
| 4.3123401 | 301 | 0.7% | |
| 4.3737121 | 300 | 0.7% | |
| 4.03964242 | 289 | 0.7% | |
| 2.72839865 | 285 | 0.7% | |
| 5.336838397 | 279 | 0.7% | |
| 4.3227779 | 269 | 0.7% | |
| 3.2073611 | 267 | 0.7% | |
| 4.948461 | 259 | 0.6% | |
| 4.469525409 | 258 | 0.6% | |
| 3.14416 | 235 | 0.6% | |
| 2.580669689 | 232 | 0.6% | |
| 4.1780279 | 219 | 0.5% | |
| 5.5734203 | 218 | 0.5% | |
| 2.806340123 | 217 | 0.5% | |
| 3.6020465 | 213 | 0.5% | |
| Other values (1026) | 31121 | 77.0% |
| Value | Count | Frequency (%) | |
| 2.580669689 | 232 | 0.6% | |
| 2.6262588 | 7 | < 0.1% | |
| 2.64344877 | 42 | 0.1% | |
| 2.644911715 | 2 | < 0.1% | |
| 2.673321 | 7 | < 0.1% | |
| 2.707311916 | 365 | 0.9% | |
| 2.722259264 | 80 | 0.2% | |
| 2.72256881 | 5 | < 0.1% | |
| 2.72839865 | 285 | 0.7% | |
| 2.740961051 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6.3009381 | 2 | < 0.1% | |
| 6.2642498 | 1 | < 0.1% | |
| 6.257827 | 8 | < 0.1% | |
| 6.2053573 | 5 | < 0.1% | |
| 6.1884932 | 3 | < 0.1% | |
| 6.1651484 | 1 | < 0.1% | |
| 6.1258953 | 8 | < 0.1% | |
| 6.121679358 | 12 | < 0.1% | |
| 6.1117543 | 28 | 0.1% | |
| 6.1108404 | 5 | < 0.1% |
longitude
Real number (ℝ≥0)
| Distinct | 1051 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.85386439 |
|---|---|
| Minimum | 49.5085018 |
| Maximum | 51.4743516 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 315.7 KiB |
Quantile statistics
| Minimum | 49.5085018 |
|---|---|
| 5-th percentile | 50.3102184 |
| Q1 | 50.6701887 |
| median | 50.8704524 |
| Q3 | 51.1044854 |
| 95-th percentile | 51.2996935 |
| Maximum | 51.4743516 |
| Range | 1.9658498 |
| Interquartile range (IQR) | 0.4342967 |
Descriptive statistics
| Standard deviation | 0.3251105424 |
|---|---|
| Coefficient of variation (CV) | 0.006393035148 |
| Kurtosis | 1.466404874 |
| Mean | 50.85386439 |
| Median Absolute Deviation (MAD) | 0.2222474 |
| Skewness | -0.9593327094 |
| Sum | 2054241.852 |
| Variance | 0.1056968648 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 51.2211097 | 886 | 2.2% | |
| 51.34942965 | 750 | 1.9% | |
| 51.2303177 | 685 | 1.7% | |
| 51.0397129 | 634 | 1.6% | |
| 50.8018201 | 498 | 1.2% | |
| 50.8465573 | 451 | 1.1% | |
| 51.2996935 | 447 | 1.1% | |
| 50.648205 | 394 | 1.0% | |
| 51.09779175 | 365 | 0.9% | |
| 50.8222854 | 323 | 0.8% | |
| 50.8381411 | 301 | 0.7% | |
| 50.8676041 | 300 | 0.7% | |
| 50.9429755 | 289 | 0.7% | |
| 51.14416005 | 285 | 0.7% | |
| 50.930358 | 279 | 0.7% | |
| 50.8543551 | 269 | 0.7% | |
| 51.2147083 | 267 | 0.7% | |
| 51.3233812 | 259 | 0.6% | |
| 51.2115284 | 258 | 0.6% | |
| 50.9687312 | 235 | 0.6% | |
| 51.09437775 | 232 | 0.6% | |
| 51.1933908 | 219 | 0.5% | |
| 50.6451381 | 218 | 0.5% | |
| 51.18331695 | 217 | 0.5% | |
| 50.7476192 | 213 | 0.5% | |
| Other values (1026) | 31121 | 77.0% |
| Value | Count | Frequency (%) | |
| 49.5085018 | 10 | < 0.1% | |
| 49.5577562 | 11 | < 0.1% | |
| 49.5580794 | 14 | < 0.1% | |
| 49.5581925 | 2 | < 0.1% | |
| 49.5642065 | 16 | < 0.1% | |
| 49.5675296 | 18 | < 0.1% | |
| 49.5749479 | 6 | < 0.1% | |
| 49.5814209 | 32 | 0.1% | |
| 49.5903125 | 13 | < 0.1% | |
| 49.5969055 | 24 | 0.1% |
| Value | Count | Frequency (%) | |
| 51.4743516 | 34 | 0.1% | |
| 51.4677957 | 10 | < 0.1% | |
| 51.46092495 | 6 | < 0.1% | |
| 51.45063155 | 29 | 0.1% | |
| 51.43155825 | 4 | < 0.1% | |
| 51.41275043 | 4 | < 0.1% | |
| 51.41206885 | 21 | 0.1% | |
| 51.3994474 | 23 | 0.1% | |
| 51.3975366 | 1 | < 0.1% | |
| 51.39574085 | 19 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| Flandre-Occidentale | |
|---|---|
| Anvers | |
| Flandre-Orientale | |
| Hainaut | |
| Liège | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| Flandre-Occidentale | 7235 | 17.9% | |
| Anvers | 5313 | 13.2% | |
| Flandre-Orientale | 5102 | 12.6% | |
| Hainaut | 4115 | 10.2% | |
| Liège | 3936 | 9.7% | |
| Bruxelles-Capitale | 3836 | 9.5% | |
| Brabant flamand | 3795 | 9.4% | |
| Limbourg | 2593 | 6.4% | |
| Brabant wallon | 1759 | 4.4% | |
| Namur | 1564 | 3.9% | |
| Luxembourg | 1147 | 2.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 12.25881916 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 62597 | 12.6% | |
| e | 58915 | 11.9% | |
| n | 45210 | 9.1% | |
| l | 43495 | 8.8% | |
| r | 37446 | 7.6% | |
| i | 26817 | 5.4% | |
| t | 25842 | 5.2% | |
| d | 23367 | 4.7% | |
| - | 16173 | 3.3% | |
| c | 14470 | 2.9% | |
| u | 14402 | 2.9% | |
| F | 12337 | 2.5% | |
| O | 12337 | 2.5% | |
| B | 9390 | 1.9% | |
| b | 9294 | 1.9% | |
| s | 9149 | 1.8% | |
| m | 9099 | 1.8% | |
| L | 7676 | 1.6% | |
| g | 7676 | 1.6% | |
| 5554 | 1.1% | ||
| o | 5499 | 1.1% | |
| A | 5313 | 1.1% | |
| v | 5313 | 1.1% | |
| x | 4983 | 1.0% | |
| H | 4115 | 0.8% | |
| Other values (6) | 18726 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 416900 | 84.2% | |
| Uppercase Letter | 56568 | 11.4% | |
| Dash Punctuation | 16173 | 3.3% | |
| Space Separator | 5554 | 1.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 12337 | 21.8% | |
| O | 12337 | 21.8% | |
| B | 9390 | 16.6% | |
| L | 7676 | 13.6% | |
| A | 5313 | 9.4% | |
| H | 4115 | 7.3% | |
| C | 3836 | 6.8% | |
| N | 1564 | 2.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 62597 | 15.0% | |
| e | 58915 | 14.1% | |
| n | 45210 | 10.8% | |
| l | 43495 | 10.4% | |
| r | 37446 | 9.0% | |
| i | 26817 | 6.4% | |
| t | 25842 | 6.2% | |
| d | 23367 | 5.6% | |
| c | 14470 | 3.5% | |
| u | 14402 | 3.5% | |
| b | 9294 | 2.2% | |
| s | 9149 | 2.2% | |
| m | 9099 | 2.2% | |
| g | 7676 | 1.8% | |
| o | 5499 | 1.3% | |
| v | 5313 | 1.3% | |
| x | 4983 | 1.2% | |
| è | 3936 | 0.9% | |
| p | 3836 | 0.9% | |
| f | 3795 | 0.9% | |
| w | 1759 | 0.4% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 16173 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 5554 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 473468 | 95.6% | |
| Common | 21727 | 4.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 62597 | 13.2% | |
| e | 58915 | 12.4% | |
| n | 45210 | 9.5% | |
| l | 43495 | 9.2% | |
| r | 37446 | 7.9% | |
| i | 26817 | 5.7% | |
| t | 25842 | 5.5% | |
| d | 23367 | 4.9% | |
| c | 14470 | 3.1% | |
| u | 14402 | 3.0% | |
| F | 12337 | 2.6% | |
| O | 12337 | 2.6% | |
| B | 9390 | 2.0% | |
| b | 9294 | 2.0% | |
| s | 9149 | 1.9% | |
| m | 9099 | 1.9% | |
| L | 7676 | 1.6% | |
| g | 7676 | 1.6% | |
| o | 5499 | 1.2% | |
| A | 5313 | 1.1% | |
| v | 5313 | 1.1% | |
| x | 4983 | 1.1% | |
| H | 4115 | 0.9% | |
| è | 3936 | 0.8% | |
| C | 3836 | 0.8% | |
| Other values (4) | 10954 | 2.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 16173 | 74.4% | |
| 5554 | 25.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 491259 | 99.2% | |
| None | 3936 | 0.8% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 62597 | 12.7% | |
| e | 58915 | 12.0% | |
| n | 45210 | 9.2% | |
| l | 43495 | 8.9% | |
| r | 37446 | 7.6% | |
| i | 26817 | 5.5% | |
| t | 25842 | 5.3% | |
| d | 23367 | 4.8% | |
| - | 16173 | 3.3% | |
| c | 14470 | 2.9% | |
| u | 14402 | 2.9% | |
| F | 12337 | 2.5% | |
| O | 12337 | 2.5% | |
| B | 9390 | 1.9% | |
| b | 9294 | 1.9% | |
| s | 9149 | 1.9% | |
| m | 9099 | 1.9% | |
| L | 7676 | 1.6% | |
| g | 7676 | 1.6% | |
| 5554 | 1.1% | ||
| o | 5499 | 1.1% | |
| A | 5313 | 1.1% | |
| v | 5313 | 1.1% | |
| x | 4983 | 1.0% | |
| H | 4115 | 0.8% | |
| Other values (5) | 14790 | 3.0% |
Most frequent None characters
| Value | Count | Frequency (%) | |
| è | 3936 | 100.0% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 315.7 KiB |
| Flandre | |
|---|---|
| Wallonie | |
| Bruxelles |
| Value | Count | Frequency (%) | |
| Flandre | 24038 | 59.5% | |
| Wallonie | 12521 | 31.0% | |
| Bruxelles | 3836 | 9.5% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.4998886 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 56752 | 18.7% | |
| e | 44231 | 14.6% | |
| a | 36559 | 12.1% | |
| n | 36559 | 12.1% | |
| r | 27874 | 9.2% | |
| F | 24038 | 7.9% | |
| d | 24038 | 7.9% | |
| W | 12521 | 4.1% | |
| o | 12521 | 4.1% | |
| i | 12521 | 4.1% | |
| B | 3836 | 1.3% | |
| u | 3836 | 1.3% | |
| x | 3836 | 1.3% | |
| s | 3836 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 262563 | 86.7% | |
| Uppercase Letter | 40395 | 13.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 24038 | 59.5% | |
| W | 12521 | 31.0% | |
| B | 3836 | 9.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 56752 | 21.6% | |
| e | 44231 | 16.8% | |
| a | 36559 | 13.9% | |
| n | 36559 | 13.9% | |
| r | 27874 | 10.6% | |
| d | 24038 | 9.2% | |
| o | 12521 | 4.8% | |
| i | 12521 | 4.8% | |
| u | 3836 | 1.5% | |
| x | 3836 | 1.5% | |
| s | 3836 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 302958 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 56752 | 18.7% | |
| e | 44231 | 14.6% | |
| a | 36559 | 12.1% | |
| n | 36559 | 12.1% | |
| r | 27874 | 9.2% | |
| F | 24038 | 7.9% | |
| d | 24038 | 7.9% | |
| W | 12521 | 4.1% | |
| o | 12521 | 4.1% | |
| i | 12521 | 4.1% | |
| B | 3836 | 1.3% | |
| u | 3836 | 1.3% | |
| x | 3836 | 1.3% | |
| s | 3836 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 302958 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 56752 | 18.7% | |
| e | 44231 | 14.6% | |
| a | 36559 | 12.1% | |
| n | 36559 | 12.1% | |
| r | 27874 | 9.2% | |
| F | 24038 | 7.9% | |
| d | 24038 | 7.9% | |
| W | 12521 | 4.1% | |
| o | 12521 | 4.1% | |
| i | 12521 | 4.1% | |
| B | 3836 | 1.3% | |
| u | 3836 | 1.3% | |
| x | 3836 | 1.3% | |
| s | 3836 | 1.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | postal_code | city_name | type_of_property | price | number_of_rooms | house_area | fully_equipped_kitchen | open_fire | terrace | garden | surface_of_the_land | number_of_facades | swimming_pool | state_of_the_building | lattitude | longitude | province | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1050 | Ixelles | 0 | 340000 | 6 | 203 | 1 | 0 | 1 | 0 | 95 | 2 | 0 | to be done up | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 1 | 1 | 1050 | Ixelles | 0 | 520000 | 4 | 200 | 0 | 0 | 0 | 0 | 69 | 2 | 0 | to renovate | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 2 | 3 | 1050 | Ixelles | 0 | 599000 | 4 | 160 | 1 | 0 | 1 | 1 | 100 | 2 | 0 | to be done up | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 3 | 4 | 1050 | Ixelles | 0 | 599000 | 3 | 160 | 1 | 0 | 1 | 1 | 130 | 2 | 0 | good | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 4 | 5 | 1050 | Ixelles | 0 | 575000 | 3 | 171 | 0 | 0 | 0 | 0 | 46 | 2 | 0 | just renovated | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 5 | 6 | 1050 | Ixelles | 0 | 590000 | 4 | 225 | 0 | 0 | 1 | 0 | 0 | 2 | 0 | to renovate | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 6 | 7 | 1050 | Ixelles | 0 | 575000 | 4 | 209 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | unknown | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 7 | 8 | 1050 | Ixelles | 0 | 595000 | 1 | 195 | 1 | 1 | 1 | 1 | 617 | 4 | 0 | as new | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 8 | 9 | 1050 | Ixelles | 0 | 595777 | 4 | 250 | 0 | 0 | 0 | 0 | 70 | 2 | 0 | unknown | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
| 9 | 11 | 1050 | Ixelles | 0 | 650000 | 6 | 250 | 1 | 0 | 0 | 0 | 60 | 2 | 0 | good | 4.381571 | 50.822285 | Bruxelles-Capitale | Bruxelles |
Last rows
| df_index | postal_code | city_name | type_of_property | price | number_of_rooms | house_area | fully_equipped_kitchen | open_fire | terrace | garden | surface_of_the_land | number_of_facades | swimming_pool | state_of_the_building | lattitude | longitude | province | region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 40385 | 52063 | 4342 | Hognoul | 0 | 399000 | 4 | 180 | 1 | 1 | 1 | 1 | 680 | 3 | 0 | as new | 5.455639 | 50.680810 | Liège | Wallonie |
| 40386 | 52064 | 4342 | Hognoul | 0 | 425000 | 3 | 315 | 1 | 0 | 1 | 1 | 0 | 3 | 0 | unknown | 5.455639 | 50.680810 | Liège | Wallonie |
| 40387 | 52065 | 7743 | Obigies | 0 | 390000 | 4 | 340 | 1 | 1 | 0 | 1 | 2164 | 4 | 0 | unknown | 3.364281 | 50.662055 | Hainaut | Wallonie |
| 40388 | 52067 | 3050 | Oud-Heverlee | 0 | 420000 | 5 | 185 | 0 | 0 | 0 | 1 | 465 | 0 | 0 | to be done up | 4.667897 | 50.821768 | Brabant flamand | Flandre |
| 40389 | 52068 | 3050 | Oud-Heverlee | 0 | 435000 | 4 | 234 | 1 | 0 | 1 | 0 | 0 | 3 | 0 | as new | 4.667897 | 50.821768 | Brabant flamand | Flandre |
| 40390 | 52070 | 1472 | Vieux-Genappe | 0 | 475000 | 5 | 216 | 1 | 1 | 0 | 0 | 1550 | 4 | 1 | as new | 4.401503 | 50.629025 | Brabant wallon | Wallonie |
| 40391 | 52071 | 1472 | Vieux-Genappe | 0 | 475000 | 5 | 215 | 1 | 0 | 1 | 0 | 1550 | 0 | 1 | good | 4.401503 | 50.629025 | Brabant wallon | Wallonie |
| 40392 | 52072 | 1461 | Haut-Ittre | 0 | 499000 | 5 | 275 | 1 | 0 | 1 | 1 | 1561 | 4 | 0 | unknown | 4.296472 | 50.648804 | Brabant wallon | Wallonie |
| 40393 | 52073 | 1761 | Borchtlombeek | 0 | 495000 | 4 | 235 | 1 | 0 | 0 | 1 | 488 | 4 | 0 | unknown | 4.136915 | 50.848178 | Brabant flamand | Flandre |
| 40394 | 52075 | 3381 | Kapellen | 0 | 485000 | 3 | 220 | 0 | 0 | 1 | 0 | 1019 | 4 | 0 | good | 4.960878 | 50.887345 | Brabant flamand | Flandre |